Multiple hypothesis testing, adjusting for latent variables

نویسندگان

  • Yunting Sun
  • Nancy Zhang
چکیده

In high throughput settings we inspect a great many candidate variables (e.g. genes) searching for associations with a primary variable (e.g. a phenotype). High throughput hypothesis testing can be made difficult by the presence of systemic effects and other latent variables. It is well known that those variables alter the level of tests and induce correlations between tests. It is less well known that dependencies can change the relative ordering of significance levels among hypotheses. Poor rankings lead to wasteful and ineffective followup studies. The problem becomes acute for latent variables that are correlated with the primary variable. We propose a two stage analysis to counter the effects of latent variables on the ranking of hypotheses. Our method, called LEAPP, statistically isolates the latent variables from the primary one. In simulations it gives better ordering of hypotheses than competing methods such as SVA and EIGENSTRAT. For an illustration, we turn to data from the AGEMAP study relating gene expression to age for 16 tissues in the mouse. LEAPP generates rankings with greater consistency across tissues than the rankings attained by the other methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

شناسایی انگیزه‌های انتقال پیام تبلیغاتی در بازاریابی ویروسی (مورد مطالعه: دانشگاه مازندران)

Despite increasing popularity of virus marketing, the factors influencing its suc-cess are not completely identified yet. This study aims to identify message deliv-ery motivations in virus advertisement. This is descriptive survey in which 396 students in the University of Mazandaran were selected by clustering sampling and data were gathered through questionnaire. Material incentive, altruism,...

متن کامل

P 4: The Hypothesis Detect Multiple Sclerosis in Early Stage with Saliva Testing

Introduction: Recent studies point to the clinical and research efficacy of saliva as a respected diagnostic aid for observing Multiple Sclerosis. The objectives of this Hypothesis are to identify novel biomarkers recognized to Multiple Sclerosis in early stage in saliva and to determine if the levels of these markers correlate with level of these Cerebrospinal fluid and blood assays and urine ...

متن کامل

Fuzzy P-values in Latent Variable Problems

We consider the problem of testing a statistical hypothesis where the scientifically meaningful test statistic is a function of latent variables. In particular , we consider detection of genetic linkage, where the latent variables are patterns of inheritance at specific genome locations. Fuzzy p-values, introduced by Geyer & Meeden (2005) are random variables (described by their probability dis...

متن کامل

LINEAR HYPOTHESIS TESTING USING DLR METRIC

Several practical problems of hypotheses testing can be under a general linear model analysis of variance which would be examined. In analysis of variance, when the response random variable Y , has linear relationship with several random variables X, another important model as analysis of covariance can be used. In this paper, assuming that Y is fuzzy and using DLR metric, a method for testing ...

متن کامل

ارائه مدل ساختاری چابکی، مزیت رقابتی، و عملکرد سازمان‌های تولیدی ایران

In our country, government has recently invested many investments in part of Information Technology. Researchers have taken note of the potential efficiency and effectiveness of those investments. IT infrastructure expenditure accounts for over 58 percent of an organization's IT budgets. This research objective is: design a model for effect of flexible IT infrastructure on competitive advantage...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011